Improving speech recognition for children using acoustic adaptation and pronunciation modeling

نویسندگان

Prashanth Gurunath Shivakumar

Alexandros Potamianos

Sungbok Lee

Shrikanth S. Narayanan

چکیده

Developing a robust Automatic Speech Recognition (ASR) system for children is a challenging task because of increased variability in acoustic and linguistic correlates as function of young age. The acoustic variability is mainly due to the developmental changes associated with vocal tract growth. On the linguistic side, the variability is associated with limited knowledge of vocabulary, pronunciations and other linguistic constructs. This paper presents a preliminary study towards better acoustic modeling, pronunciation modeling and front-end processing for children’s speech. Results are presented as a function of age. Speaker adaptation significantly reduces mismatch and variability improving recognition results across age groups. In addition, introduction of pronunciation modeling shows promising performance improvements.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Pronunciation and Acoustic Model Adaptation for Improving Multilingual Speech Recognition

In this paper, we address the importance of pronunciation and acoustic model adaptation in multilingual speech recognition. When aiming at modeling several languages simultaneously, the degree of speaker and language variability is even greater than when concentrating on only one language. To compensate the pronunciation variability across various speaker, bi-lingual pronunciation modeling is p...

متن کامل

Multilingual Pronunciat Improving Multilingual S

Multilinguality aspects are becoming increasingly important in the Automatic Speech Recognition (ASR) systems. It is apparent that coping with large variability of the speech signal is an even bigger challenge in multilingual ASR systems than it has been in conventional monolingual systems. In this paper, we address the importance of combining multilingual pronunciation modeling and acoustic mo...

متن کامل

Approaches to foreign-accented speaker-independent speech recognition

Current research in the area of foreign-accented speech recognition focusses either on acoustic model adaptation or speakerdependent pronunciation variation modeling. In this paper both approaches are applied in parallel and in a speaker-independent fashion: the acoustic modeling part is based on a derived Hidden Markov Model (HMM) clustering algorithm and the lexicon adaptation is based on spe...

متن کامل

Modeling Cantonese Pronunciation Variations for Large-Vocabulary Continuous Speech Recognition

This paper presents different methods of handling pronunciation variations in Cantonese large-vocabulary continuous speech recognition. In an LVCSR system, three knowledge sources are involved: a pronunciation lexicon, acoustic models and language models. In addition, a decoding algorithm is used to search for the most likely word sequence. Pronunciation variation can be handled by explicitly m...

متن کامل

Integration of MLLR adaptation with pronunciation proficiency adaptation for non-native speech recognition

To recognize non-native speech, larger acoustic/linguistic distortions must be handled adequately in acoustic modeling, language modeling, lexical modeling, and/or decoding strategy. In this paper, a novel method to enhance MLLR adaptation of acoustic models for non-native speech recognition is proposed. In the case of native speech recognition, MLLR speaker adaptation was successfully introduc...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2014

Improving speech recognition for children using acoustic adaptation and pronunciation modeling

نویسندگان

چکیده

منابع مشابه

Pronunciation and Acoustic Model Adaptation for Improving Multilingual Speech Recognition

Multilingual Pronunciat Improving Multilingual S

Approaches to foreign-accented speaker-independent speech recognition

Modeling Cantonese Pronunciation Variations for Large-Vocabulary Continuous Speech Recognition

Integration of MLLR adaptation with pronunciation proficiency adaptation for non-native speech recognition

عنوان ژورنال:

اشتراک گذاری